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Summary 

The bat coronavirus HKU4 belongs to the same 2c lineage as that of the deadly Middle East 
Respiratory Syndrome coronavirus (MERS-CoV) and shows high sequence similarity, therefore 
potentiating a threat to the human population through a zoonotic shift or “spill over’ event. To date, 
there are no effective vaccines»or antiviral treatments available that are capable of limiting the 
pathogenesis of any human coronaviral infection. An attractive target for the development of anti- 
coronaviral therapeutics is the 3C-like protease (8CL””), which is essential for the progression of the 
coronaviral life cycle..Herein, we report the screening results of a small, 230-member peptidomimetic 
library against HKU4-CoV 3CL"° and the identification of 43 peptidomimetic compounds showing 
good to excellent inhibitory potency of HKU4-CoV 3CL™ with IC,, values ranging from low micromolar 
to sub-micromolar. We established structure-activity relationships (SARs) describing the important 
ligand-based features required for potent HKU4-CoV 3CL°° inhibition and identified a seemingly 
favored peptidic backbone for HKU4-CoV 3CL”™ inhibition. To investigate this, a molecular sub- 
structural analysis of the most potent HKU4-CoV 3CL”® inhibitor was accomplished by the synthesis 
and testing of the lead peptidomimetic inhibitor’s sub-structural components, confirming the activity of 
the favored backbone (22A) identified via SAR analysis. In order to elucidate the structural reasons 
for such potent HKU4-CoV 3CL”° inhibition by the peptidomimetics having the 22A backbone, we 
determined the X-ray structures of HKU4-CoV 3CL”™ in complex with three peptidomimetic inhibitors. 


Sequence alignment of HKU4-CoV 3CL"°, and two other lineage C Betacoronaviruses 3CL"’s, 
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HKU5-CoV and MERS-CoV 3CL°°, show that the active site residues of HKU4-CoV 3CL” that 


participate in inhibitor binding are conserved in HKU5-CoV and MERS-CoV 3CL°®. Furthermore, we 
assayed our most potent HKU4-CoV 3CL™ inhibitor for inhibition of HKU5-CoV 3CL°° and found it to 
have sub-micromolar inhibitory activity (IC,,= 0.54 + 0.03 uM). The X-ray structures and SAR 
analysis reveal critical insights into the structure and inhibition of HKU4-CoV 3CL°”, providing 
fundamental knowledge that may be exploited in the development of anti-coronaviral therapeutics for 


coronaviruses emerging from zoonotic reservoirs. 


1. Introduction 

Zoonotic viruses, including coronaviruses, Ebola virus, HIV, influenza A viruses, hantaviruses 
or henipaviruses, are ubiquitous and can emerge as significant human pathogens capable of global 
epidemics and pandemics. Most often, we spend our time and resources in targeting only the human 
pathogens with small molecule inhibitors with the ultimate goal of developing therapeutic compounds 
to treat the associated diseases. A missed opportunity, however, is targeting the zoonotic host with 
small molecule inhibitors with the goal of utilizing the acquired chemical and structural knowledge to 
inform us of the evolutionary path of the virus on a biological basis. So, we believe that by using 
small molecule compounds as probes of the structural evolution of viral enzyme drug targets, from 
the zoonotic reservoirs to the human pathogens, we can gain new insights and predict a priori the 
structural scaffolds of small molecule compounds that can serve as lead templates for therapeutic 
development against emerging human pathogenic viruses such as coronaviruses. 

Coronaviruses (CoVs) are enveloped, single-stranded, positive-sense RNA viruses that infect 
and cause disease in a variety of species including bats, birds, cats, dogs, pigs, mice, horses, 
whales, and humans.’ Goronaviral infections may range from mild to severe and can result in 
respiratory, enteric, hepatic, or neurological diseases in their carriers. The first two human CoV 
strains (HCoV-229E and HCoV-OC43) were identified in the mid-1960s, and it wasn't until the 21° 
century that.a.new human coronavirus (Severe Acute Respiratory Syndrome or SARS-CoV) was 
identified.*° Currently, there are at least six known human CoVs including: HCoV-229E, HCoV-NL63, 
HCoV-0C43, HCoV-HKU1, SARS-CoV, and most recently, the Middle East respiratory syndrome 
coronavirus (MERS-CoV).° MERS-CoV, formerly known as HCoV-EMC, was identified in November 
2012, when it was isolated from the sputum a 60-year-old Saudi Arabian man presenting acute 
pneumonia and renal failure.’ Since the time of its identification, the virus has grown to be a threat to 
public health worldwide having a case-fatality rate of about 30%.° To date, there are no vaccines or 


antiviral agents capable of preventing or treating any human coronaviral infection. 


3 
MERS-CoV belongs to lineage C in the genus Betacoronavirus of the Coronaviridae family in 


the Nidovirales order.’ Also of this lineage are the species Tylonycteris bat coronavirus HKU4 
(HKU4-CoV) and Pipistrellus bat coronavirus HKU5 (HKU5-CoV), where the overall amino acid 
sequence identities of MERS-CoV to HKU4-CoV and HKU5-CoV across the conserved domains are 
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approximately 75% and 76.7%, respectively. Though the exact origin of MERS-CoV is currently 
debated, a bat origin is strongly suspected as MERS-CoV is so closely related to HKU4- and/HKU5- 
CoV and because MERS-CoV genomic RNA has been found in bats and dromedary camels in 
Qatar.'’*"* Though HKU4-CoV and HKU5-CoV have been found only in bats, studies have shown that 
their accessory proteins are capable of inhibiting human antiviral signaling pathways in vitro."""° This, 
and the close similarity of MERS-CoV to HKU4-CoV and HKU5-CoV, suggests that a zoonotic shift 
from bats or camels to humans may have occurred."® A recent investigation into the interactions 
between the human CD26 receptor and the receptor binding domains (RBDs) in the MERS-CoV, 
HKU4-CoV and HKU5-CoV envelope-embedded spike protein revealed that MERS-CoV and HKU4- 
CoV both engage this receptor for viral entry whereas HKU5-CoV does not."” These observations 
suggest that an evolutionary pathway from bat HKU4-CoV to human MERS-CoV exists and that 
investigating the molecular basis of this zoonotic. shift from a structural and chemical-biology 
perspective may allow us to predict and target the these viruses with small molecule therapeutics. 

Coronaviral genomes are polycistronic, encoding for two large polyproteins, pp1a and pptab.'” 
* Initiation of coronavirus replication. in cells occurs by the translation of two overlapping, open 
reading frames (ORF1a and ORF1b) to produce ppia and, following a -1 ribosomal frameshift 
mechanism, ppiab. These polyproteins are then proteolytically processed at 14 cleavage sites by 
two essential viral cysteine proteases, the papain-like protease (PL’®, or nsp3) and the 3C-like 
protease (3CL"°, also known as the main protease, M’°, or nsp5). Cleavage by both proteases 
results in the production of 16 nonstructural proteins (nsps), where PL”” is responsible for cleavage at 
3 sites and 3GL”° is responsible for cleavage at 11 sites (Figure 1). The function of 3CL”™ is vital for 
the coronaviral life cycle, making it an attractive target for the development of antiviral drugs.”"” 

The present work was undertaken to investigate the kinetic and structural properties of HKU4- 
CoV 3CL™ and to utilize this knowledge to discover and develop potent inhibitors of HKU4-CoV 
3CL”". Targeting the immediate zoonotic reservoirs of coronaviruses with small molecule inhibitors 
can help inform structure-based design strategies aimed at creating molecular scaffolds that may also 
target the emerging human CoVs and ultimately aid in the development of therapeutics against 
coronaviral infection. Towards this goal, we first expressed, purified, and characterized the kinetic 
properties of HKU4-CoV 3CL"’. We then determined the inhibition of HKU4-CoV 3CLpro by a small 


library of 230 peptidomimetic compounds, which resulted in the identification of 43 HKU4-CoV 
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3CLpro inhibitors, two of which showed sub-micromolar potency. With a potent lead inhibitor 


compound in hand, we investigated the contributions of its individual sub-structural components to 
inhibitory potency and identified a favored peptidic backbone for HKU4-CoV 3CL*” inhibition. Finally, 
we determined the X-ray crystal structures of the three most potent inhibitors containing this favored 
backbone in complex with HKU4-CoV 3CL”™ and elucidated the structural reasons behind such potent 
HKU4-CoV 3CL*” inhibition. 


2. Results and Discussion 
2.1. Screen of peptidomimetic library for HKU4-CoV 3CL”” inhibition 


A library of 230 previously reported, peptidomimetic-type compounds 


*°24 was screened for 


inhibition of HKU4-CoV 3CL”° at a single concentration of 100 uM, and IC,, values were determined 
for compounds that showed greater than 50% inhibition of HKU4-CoV 3CL*° at that concentration. 
This screen resulted in the identification of 43 peptidomimetic compounds with inhibitory 
concentrations ranging from micromolar to sub-micromolar (vide infra). The peptidomimetic inhibitors 
can be grouped into two classes, inhibitors with a single amide bond in the inhibitor backbone (Class 
A) and inhibitors with a dipeptide-like backbone (Class B). The results of this screen are displayed in 
Table 1a (Class A) and Table 1b (Class B), where only the compounds that showed greater than 50% 
inhibition of HKU4-CoV 3CL"° at 100 wM are shown (see Supporting Information S1 and S2 for 
comprehensive list of compounds tested).. The compounds are listed in the tables in rank order by 
IC,, values starting with the lowest IC,, value. Of the 25 Class A inhibitors that were tested for 
inhibition of HKU4-CoV 3GL"’, 19 were found to inhibit HKU4-CoV 3CL”° above 50% at a 
concentration of 100 uM, and subsequently, IC,, values were determined. Remarkably, the two most 
potent inhibitors, 1A and,2A, proved to be sub-micromolar inhibitors of HKU4-CoV 3CL°”, having 
IC's of 0.33 + 0.02 and 0.41 + 0.04 uM, respectively. Upon examination of all 27 Class A inhibitors, 
an immediate trend is apparent where there is an overwhelming preference for a 3-thiophene at the 
R, position and a 1-methylbenzotriazole at the R, position of the peptidomimetic backbone. This is 
observed in every Class A inhibitor found to produce inhibition of HKU4-CoV 3CL”™° over 50% at 100 
uM, with the exception of the least active analogue in this series, 19A. 19A only differs from the more 
active compound 6A in that 19A has a triazole in place of a benzotriazole; however, this difference is 
significant enough to decrease inhibitory potency ten-fold between the analogues, from 1.6 + 0.1 uM 
for 6A to 16.0 + 4.0 uM for 19A. Interestingly, 19A maintains a triazole ring in the R, position, 
indicating that the position of the nitrogen atoms in this heterocycle is essential for good HKU4-CoV 
3CL™ inhibition. Other R, heterocycles incorporated in the less active members of Class A include 4- 
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substituted methyltriazoles, 4-imidazoles, and 2-furans, underscoring the importance of the 


methylbenzotriazole at the R, position and pointing to the potential significance of a precisely oriented 
hydrogen bond acceptor (see Supporting information S1 for a comprehensive list). 

Of the 19 Class A peptidomimetics found to be good inhibitors of HKU4-CoV 3CL°", 13 have 
amide substituents at R,, four have aromatics (5A, 7A, 9A, and 11A), and two have amines (16A and 
17A). Among the inhibitors with amide substituents at R,, a trend can be observed.where cyclic 
carboxamides are preferred (1A, 2A, 3A, 4A, and 8A) to alkyl carboxamides, with the aromatic 
carboxamides being the most active (1A and 2A). A preference for larger cyclic carboxamides is also 
observed, where 4, 5 or 6 membered cycles (1A, 2A, 3A, and 4A) are preferred over the smaller 
cyclopropylcarboxamide (8A). 

Among the six Class A inhibitors with aromatic groups substituted at the R, position (5A, 7A, 
9A, 11A, 20A, 21A), only the four compounds 5A, 7A, 9A and.11A were found to inhibit HKU4-CoV 
3CL*” above 50% at 100 uM and they each contain aromatic nitrogen heterocycles. The position of 
the nitrogen is of moderate significance, as a p-substituted pyridine at R, is 2-fold more active than 
the m-substituted pyridine (5A vs. 11A, Table 1a). Bulky substituents on R, aromatic heterocycles, 
such as in 20A, decrease inhibition of HKU4-CoV 3CL””, indicating a steric requirement within the R, 
binding pocket (Table 1a and S1). Interestingly, 20A is one of only two Class A compound having a 
3-thiophene at the R, position and a.1-methylbenzotriazole at the R, position that is not a good 
inhibitor of HKU4-CoV 3CLpro at 100 pM. Addition of a methoxy substituent to the R, heterocycle is 
tolerated (7A and 9A, Table 1a), but does not increase or decrease inhibition significantly, indicating 
that the addition of another hydrogen-bond acceptor does not further engage the R, binding pocket 
and that smaller aromatic substituents can be tolerated. 

The two Class A peptidomimetics that have R, amines, 16A and 17A, have similar IC,, values 
(4.8 + 0.4 and 5.3 + 0.6 uM, respectively); however, they have vastly different steric requirements 
(methyl vs. benzyl). As opposed to the R, amides and aromatics, this may indicate that when an 
amine substituent is present at the R, position, it dictates an alternative binding orientation within the 
R, binding space that does not lead to such strict steric requirements. Alternatively, the relatively 
hydrophobic benzyl group of 17A may decrease inhibitor solubility and consequent inhibitor 
availability, therefore resulting in lower % maximum inhibition and IC,, value than would otherwise be 
observed. This may also be the case for 21A, where the R, aromatic ring lacks the nitrogen found in 
the more active inhibitors 5A and 11A (Table 1a and S1). 

A comparison of 17A to the second most active inhibitor in Class A, 2A, which is only different 


from 17A by the presence of a carbonyl, shows that the carbonyl of 2A is crucial for sub-micromolar 


6 
inhibitory potency (0.41 + 0.04 uM for 2A vs. 5.3 + 0.6 uM for 17A). This observed 10-fold increase 


in IC,, may be a consequence of the loss of a stabilizing hydrogen-bond interaction between the 
amide —NH group or carbonyl oxygen of the inhibitor and the HKU4-CoV 3CL™ binding site. The 
change in hybridization between an amine and an amide also results in a change in the molecular 
geometry of the R, substituent, which may alter hydrogen-bonding properties and introduce additional 
effects that may be important factors in dictating good enzymatic inhibition. The X-ray. structure of 
HKU4-CoV 3CL”° in complex with 2A, discussed below, shows that the amide —NH group of the 
carboxamide group of 2A forms an important interaction with the backbone carbonyl of His41. 

Of the 205 Class B inhibitors that were tested for inhibition of HKU4-CoV 3CL”, 24 were found 
to inhibit HKU4-CoV 3CL”" above 50% at a concentration of 100 uM, and subsequently, IC,, values 
were determined (Table 1b). Interestingly, we found fewer good inhibitors and no sub-micromolar 
inhibitors of HKU4-CoV 3CL”° among this larger, more substituted dipeptide-like class of compounds 
(~76% of compounds in Class A (19 of 25 compounds tested) were good inhibitors of HKU4-CoV 
3CL” while only ~12% of compounds in Class B (24 of 205 compounds tested) were found to be 
good inhibitors). This result could be a consequence of the increased steric bulk perturbing the 
preferred binding orientation of the Class B compounds in the HKU4-CoV 3CL”” binding site. A 
comparison of similar Class A and Class B inhibitors more clearly elucidates this trend, where the 
addition of the sterically bulky R, #Bu amide in the Class B compounds decreases HKU4-CoV 3CL”° 
inhibition in each instance (S3). Forexample, the most active Class A inhibitor, 1A, having and IC, of 
0.33 + 0.02 uM and 93% inhibition at 100 uM, is almost completely inactivated by the addition of the t 
Bu amide as seen in the Class B compound 25B, with only 26% inhibition at 100 uM. ~—_ Within the 
Class B inhibitors, there is a preference for the 3-thiophene at the R, position and a 1- 
methylbenzotriazole atthe R, position of the peptidomimetic backbone, though it is not as strong as 
that observed in within Class A (25% of compounds in Class B vs. 95% of compounds in Class A). 
Among the series where R,= 3-thiophene and R, = 1-methylbenzotriazole (compounds 3B, 6B, 9B, 
16B, 18B, and 21B) small carboxamide groups at the R, position are favored as observed by the 
comparison of 3B to 28B and 6B to 27B (see Tables 2 and 3). Possibly, Class B inhibitors with small 
R, amides allow for a shift in the position of the inhibitor in the HKU4-CoV 3CL” binding site to 
accommodate the bulky t-Bu amide at the R, position. Again, an R, amide is preferred to an amine 
for good inhibition of HKU4-CoV 3CL” (IC,,= 1.8 + 0.5 uM for 3B vs. 22.0 + 7.4 uM for 21B). 
Replacement of the benzotriazole within this series with a benzimidazole has dissimilar effects; for 
example, comparing 19B (Table 1b) to 28B (S2 and S3) where the benzimidazole of 19B has been 


replaced with a benzotriazole, inhibition decreases from 68% to 34% at 100 uM, respectively. In 
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7 
contrast, comparison of 23B to 18B shows that substitution of benzimidazole for benzotriazole 


increases inhibition from 56% to 76% at 100 uM (Table 1b). Interestingly, only Class B compounds 
having a para-substituted phenyl ring with an R, substituent were found to be good inhibitors of 
HKU4-CoV 3CL"°; compounds with other, non-phenyl rings including cyclopropyl, cyclohexyl, 


adamantane, and bicycloheptane were not found to be active inhibitors. 


2.2. 1A-RFL Sub-structural activity analysis via component synthesis 

In an attempt to delineate the structural relationships that contribute to the enhanced activity of 
our most active inhibitor, 1A, we accomplished the synthesis of the sub-structural components of 1A 
and determined their individual inhibitory activities against HKU4-CoV. 3CL°"° (Scheme 1). The 
straightforward synthesis of these sub-structural 1A components; 22A, 23A, and 24A, was 
accomplished from the commercially available starting materials, benzotriazole and p- 
phenylenediamine. The synthesis of 22A was achieved. in two steps. First, alkylation of 
benzotriazole by reaction with chloroacetic acid and sodium hydroxide yielded the benzotriazoleacetic 
acid 22 in 50% yield.” | Subsequent coupling. of 22 with the commercially available 
thiophenylmethanamine using EDC produced the desired 1A component 22A in 25% yield (Scheme 
1, top). The synthesis of 23A and 24A was accomplished linearly by first coupling p- 
phenylenediamine with the commercially available thiophenecarboxylic acid using HATU, which 
resulted in the desired 1A component 23A in 58% yield. Reductive amination of 23A and 
commercially available thiophenecarbaldehyde using sodium triacetoxyborohydride produced the 
desired 1A sub-structural component 24A in 81% yield. Finally, 1A was resynthesized according to 
the literature procedure by coupling 24A with 22 using EDC as a coupling reagent.” 

The sub-structural components 22A, 23A, and 24A were then individually tested for inhibition 
of HKU4-CoV 3CL”° at a concentration of 100 uM. The 22A component, representing the R, and R, 
substituents of the Class A peptidomimetic backbone, showed 28% inhibition of HKU4-CoV 3CL”” at 
100 uM.and had an IC,,of 68 uM, supporting our hypothesis that this backbone has some inhibitory 
capacity even in the absence of the R, component. The 23A sub-structural component, representing 
the R, substituent of the Class A peptidomimetic backbone, was found to have no inhibitory activity 
against HKU4-CoV 3CL™ at 100 uM, supporting R, and R, as essential substituents. Additionally, the 
24A sub-structural component, representing the R,and R,substituents of the Class A peptidomimetic 
backbone, showed no inhibition of HKU4-CoV 3CL”™ at 100 uM, indicating the crucial nature of the 1- 
methylbenzotriazole at the R, position of the Class A backbone and supporting our previous 
observation of the need for a precisely oriented hydrogen bond acceptor at this position within the 
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8 
Class A peptidomimetics (vide supra). This finding establishes 22A as a minimum component 


necessary for inhibition of HKU4-CoV 3CL°° and suggests it may be used as a scaffold for future 
fragment based design of HKU4-CoV 3CL”” inhibitors. Interestingly, the work of Wong and coworkers 
showed that a series of benzotriazole esters, which act as potent, covalent inhibitors of SARS-3CL"”, 
are also able to inhibit SARS-3CL"° upon replacement of the ester oxygen with carbon, albeit 
weakly.” Taken together, these findings may indicate an inhibitory preference for benzotriazoles 
among coronaviral 3C-like proteases. 


2.3. X-ray crystallographic analysis of HKU4-CoV 3CL”” in complex with. 1A, 2A, and 3B. 

In order to more fully elucidate the structural factors underlying the structure-activity 
relationships (SARs) observed in the kinetic data, we determined the X-ray crystal structures of 
HKU4-CoV 3CL*° in complex with three of the most potent inhibitors, 1A, 2A, 3B and the favored 
backbone 22A. These complexes were chosen to determine the potential structural explanations 
behind our observations that Class A and B inhibitor backbones, containing a 3-thiophene and 1- 


pro 


methylbenzotriazole, are favored for HKU4-CoV 3CL™ inhibition. We also sought to determine the 
structural reasons behind the preference for Inhibition by Class A compounds over Class B 
compounds. 

Co-crystallization of 1A, 2A, 3B, and 22A with HKU4-CoV 3CL”™ produced high-quality crystals 
suitable for X-ray data collection and structure determination. Complete X-ray data sets were 
collected on these four HKU4-CoV 3CL”°-inhibitor complexes to resolutions between 1.8 A and 2.3 A. 
A summary of the data collection and refinement statistics are provided in S4. 

The X-ray crystal structures confirm that both the Class A (1A and 2A) and B (3B) 
peptidomimetics bind in the active site pocket of HKU4-Co 3CL””, which is lined primarily with polar 
residues and is solvent exposed (Figure 2). In each of the HKU4-CoV 3CL”*-inhibitor complexes, the 
carbonyl carbon of the central inhibitor backbone, i.e preceding the R, group, is located between 4.5- 
4.8 A from the’sulfur atom of the catalytic cysteine (Cys148, Figure 2, A, C, E). Both 1A and 2A, 
which have a 3-thiophene at the R, position and a 1-methylbenzotriazole at the R, position of the 
peptidomimetic backbone, position these moieties in identical chemical space (Figure 2, A&C). The 
Class B inhibitor, 3B, which has a 3-thiophene at the R, position and a 1-methylbenzotriazole at the 
R, position, also positions the 3-thiophene and 1-methylbenzotriazole functionalities in the same 
chemical space utilized by the Class A inhibitors (Figure 2, E). The 3-thiophene in the R, position of 
all three inhibitors occupies the S,-S, sub-sites of the HKU4-CoV 3CL”” active site, the anilido and R.- 
functionality reside in the S,-S,’ sub-sites, and the R, benzotriazole occupies the S,-subsite, acting as 


the P, group of the substrate. 
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The X-ray structures reveal the structural determinants for the inhibitory preference for Class 


A and B compounds with backbones containing a 3-thiophene and 1-methylbenzotriazole. This 
backbone allows the inhibitor to anchor into the HKU4-CoV 3CL”” active site via the formation of three 
key hydrogen bonds. First, there is a direct hydrogen bond that exists between the 3-nitrogen atom 
of benzotriazole and the te/e-nitrogen of His166 in each enzyme-inhibitor complex, securing the 
location and position of the benzotriazole in the active site. A second hydrogen bond.between the 
carbonyl oxygen of the inhibitor backbone and the backbone nitrogen of Glu169 in,each structure 
further anchors the inhibitor in the HKU4-CoV 3CL*” active site. Finally, the thiophene ring is 
positioned within the HKU4-CoV 3CL”™ active site allowing the formation of a relatively uncommon 
hydrogen bond to the hydroxy! hydrogen of Tyr54.”"°° 

In each of the HKU4-CoV 3CL”’-inhibitor complexes, the R,-substituted aryl ring of both Class 
A and B inhibitors is positioned in the HKU4-CoV 3CL™ active site to take advantage of an edge-to- 
face pi-interaction with the catalytic histidine, His41 (Figure 2, Ay’C, E). This interaction is may be 
important for good inhibition of HKU4-CoV 3CL”° for two reasons. First, comparing 1A and 2A to 
22A, where the R,-substituted aromatic ring of 1A and 2A is absent, a greater than 200-fold increase 
in IC,,is observed for 22A. Second, Class B compounds having non-aromatic substituents instead of 
an R,-substituted aryl ring were not found to be good inhibitors of HKU4-CoV 3CL"” (vide supra). 
These data indicate that for low micromolar to sub-micromolar inhibition of HKU4-CoV 3CL””, an R,- 
substituted aryl ring may be required. .Furthermore, there is a conserved active site water molecule 
present in the X-ray crystal structure,of each HKU4-CoV 3CL°”-inhibitor complex that mediates a 
hydrogen-bond between the.backbone carbonyl oxygen of His41 and the amide R, nitrogen of the 
inhibitor (Figure 2, A, C,.E)., This finding illuminates the observed preference for an amide substituent 
at the R, position in both Class A and B inhibitors. 

Our attempt tosolve the X-ray crystal structure of 22A in complex with HKU4-CoV 3CL°” did 
not result in the structure of the HKU4-CoV 3CL*’-inhibitor complex, but instead resulted in the X-ray 
structure of the unliganded, or free, HKU4-CoV 3CL*° enzyme, which was found to crystallize in a 
different space group than the HKU4-CoV 3CL°’-inhibitor complexes (Figure 3, A). The unbound, or 
free, HKU4-CoV 3CL™’ enzyme was found to have noteworthy changes in the active site architecture 


pro 


in comparison to the X-ray crystal structures of the HKU4-CoV 3CL”’-inhibitor complexes. In the 
absence of an inhibitor, what appears to be a metal ion binds to the HKU4-CoV 3CL*’ active site by 
coordinating to the catalytic cysteine (Cys148), which is rotated 180° from the inhibitor bound 
structures, resulting in a 3.2 A positional change in the location of the sulfur atom with a distance of 
2.2 A between the sulfur of Cys148 and the metalatom. The metal is presumed to be a zinc cation, 


as it is present in water and likely to bind to a sulfur atom (also seen in PDB ID 2YNA), though it could 
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also be iron. The tele-nitrogen of His41 is shifted 2.6 A from its position in the HKU4-CoV 3CL””- 


1A-RFL complex X-ray crystal structure, allowing for the coordination of the Zn* atom, which is 
located 2.4 A away (Figure 3, A). Additionally, the positions of the Met25, Cys44, Ala46, Tyr54, and 
Gln192 residues all noticeably shift upon superposition of the free and inhibitor-bound forms of HKU4- 
CoV 3CL””. 

Comparing the most potent Class B inhibitor having the 1-methylbenzotriazole and 3- 
thiophene backbone, 3B, to the top HKU4-CoV 3CL”” inhibitor identified, 1A, elucidates the observed 
preference for Class A inhibitors over Class B inhibitors (Figure 3, C). The (R)-enantiomer of the 
Class B compound, 3B, was found in the HKU4-CoV 3CL”” active site, with the backbone of the 
inhibitor oriented similarly to the backbones of the Class A inhibitors 1A and 2A, utilizing the same 
hydrogen bonds; however, the (A)-R, tBu amide substituent of 3B,.which is absent in the Class A 
inhibitors, marginally rotates the thiophene away from Tyr54, increasing the distance to the hydrogen- 
bond donor Tyr54 from 3.9 to 4.5 A compared to 1A, and moving the thiophene out of hydrogen- 
bonding distance to Tyr54 (distance measured to heteroatoms). The R, tBu amide of 3B extends out 
toward the solvent exposed side of the active site, inducing a structural rearrangement of HKU4-CoV 
3CL”° where GIn192, which encloses 1A and 2A in the active site, is moved 2.7 A to accommodate 
the steric bulk of the R,substituent of 3B. The combination of these factors may be attributed for the 
observed 5-fold increase in IC,,when comparing 1A and 2A to 3B and likely account for the reduced 
inhibitory activity of Class B compounds relative to Class A compounds. 

The Class B peptidomimetic compounds were synthesized and tested for HKU4-CoV 3CL”° 
inhibition as racemates, which may lead to significant differences in IC,, values. An example of this 
can be seen by the comparison of (A)-13B to racemic- and (S)-13B (Figure 4). The enantiopure (A)- 
13B has an IC,, of 9.3 + 0:5 uM while the 13B racemate has an IC,,of 11.1 + 0.6 uM and (S)-13B has 
a 50% reduction in inhibition relative to the racemate, with only 44% inhibition of HKU4-CL”° at 100 
uM.. Interestingly, (R)-3B was found in the active site of HKU4-CoV 3CL”° in the X-ray crystal 
structure (Figure 2, E and F), suggesting an (A)-stereochemical of the inhibitor for good inhibition of 
HKU4-CoV 3CL™, despite the nature of the R, and R, heterocyclic groups. 

The X-ray structures of HKU4-CoV 3CL”° in complex with inhibitors provide vital insights into 
the types of compounds that can be developed as therapeutics against lineage C Betacoronaviruses. 
A sequence alignment of HKU4-CoV, HKU5-CoV, and MERS-CoV 3CL"”, all of which are lineage C 
Betacoronaviruses, show that HKU4-CoV and HKU5-CoV are 81.0% and 82.7% identical to MERS- 
CoV 3CL°’, and HKU4-CoV is 83.7% identical to HKU5-CoV 3CL”° (Figure 5). Moreover, the 
pertinent HKU4-CoV 3CL”” active site residues (Met25, His41, Cys44, Ala46, Tyr54, Cys148, His166, 
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Glu169, and Gln192, Figures 3 and 4) identified in the X-ray crystal structures that line the active 


site and interact with the peptidomimetic inhibitors are all conserved in HKU5-CoV and MERS-CoV 
3CL”"’. These data indicate that the active site architectures among HKU4-CoV, HKU5-CoV, and 
MERS-CoV 3CL°° may be similar enough to design broad-spectrum antiviral therapeutics against 
lineage C Betacoronaviruses and have implications for the structure and inhibition of lineage C 
Betacoronaviruses yet to emerge. To test this hypothesis, we tested whether our lead HKU4-CoV 
3CL*” inhibitor, compound 1A, would also have inhibition of HKU5-CoV 3CL”’. We found that 
compound 1A is a good inhibitor of HKU5-CoV 3CL™, having 91% inhibition at 100 uM and a sub- 
micromolar IC,, value of 0.54 + 0.03 uM which is comparable to the IC, value of 0.33 + 0.02 uM 
against HKU4-3CL™. Based on these data, we predict compound 1A will show inhibition of MERS- 
CoV 3CL”” as well. 


3. Conclusions 

In conclusion, 43 of the 230 peptidomimetics were found to inhibit HKU4-CoV 3CL°”, 19 of 
which belonged to the Class A peptidomimetic series and 24 of which belonged to the Class B 
peptidomimetic series. Overall, the Class A _peptidomimetics were found to be better inhibitors 
relative to the Class B compounds, and two of the.Class A compounds were determined to have sub- 
micromolar inhibitory potency against HKU4-CoV 3CL”", 1A and 2A. Analysis of the inhibition data 
from the 230 peptidomimetics resulted»in the identification of a favored inhibitor backbone consisting 
of 3-thiophene and 1-methylbenzotriazole functionalities. To evaluate the relative contribution of the 
favored inhibitor backbone to the total HKU4-CoV 3CL*® inhibition observed for the molecule, we 
synthesized the sub-structural components of the lead inhibitor, 1A, and tested them for inhibition of 
HKU4-CoV 3CL*". — The simple compound, 22A, which represents the R, and R, inhibitor 
functionalities and consists of the 3-thiophene and 1-methylbenzotriazole of the favored backbone, 
was found to inhibit HKU4-CoV 3CL*” even in the absence of the anilido and R, components of the 
inhibitor. We then determined the X-ray crystal structures of the three most potent inhibitors 
containing this favored backbone, 1A, 2A, 3B, in complex with HKU4-CoV 3CL™. These X-ray 
crystal structures prove that the three Class A and B inhibitors bind in the HKU4-CoV 3CL”™ active 
site, utilizing the same set of three hydrogen-bonding interactions between the favored backbone 
backbone and the HKU4-CoV 3CL*° active site residues, serving to anchor compounds with this 
backbone into the active site of the enzyme. Our attempt to solve the X-ray structure of the privileged 
backbone compound, 22A, in complex with HKU4-CoV 3CL”° resulted in the structure of unbound 
HKU4-CoV 3CL”, which is presumably as a consequence of weak inhibition by 22A (28% at 100 


12 
uM). Superposition of the unbound and inhibitor-bound forms of HKU4-CoV 3CL”° revealed 


several subtle, but important changes in the active site architecture upon inhibitor binding. 
Furthermore, sequence alignment of HKU4-CoV, HKU5-CoV, and MERS-CoV 3CL”° show that the 
residues involved in inhibitor binding are conserved, indicating that identification of broad spectrum 
3CL”™ inhibitors of lineage C Betacoronaviruses is possible. In support of this, we found,our most 
active HKU4-CoV 3CL” inhibitor to have inhibition of HKU5-CoV 3CL”” in the sub-micromolar range 
(IC,,= 0.54 + 0.03 uM). These findings, and the structural details underlying the potent inhibition of 
HKU4-CoV 3CL°° by compounds containing 3-thiophene and 1-methylbenzotriazole backbones, can 
be used in the future to design compounds with the ability to more fully exploit the HKU4-CoV 3CL”° 
active site, leading ultimately to the development of more highly potent and selective inhibitors 


against human pathogens such as SARS and MERS. 


4. Experimental 
4.1. Expression and purification of HKU4-CoV 3CL”° 

The gene encoding the 3CL”° of HKU4-CoV (residues 3292-3597 in the Tylonycteris bat-CoV 
polyprotein)’ was codon optimized for expression in E. coli and cloned into pET-11a expression 
vector with an N-terminal (His),-tag followed by nsp4-/5 auto-cleavage site by BioBasic Inc. This 
construct results in the expression of HKU4-CoV 3CL”™ without an N-terminal or C-terminal extension. 
E. coli BL21(DE3) cells, transformed with pET11a-HKU4 3CL”° plasmid were grown in Super LB 
media (3 g potassium phosphate monobasic, 6 g sodium phosphate dibasic, 20 g tryptone, 5 g yeast 
extracts, 5 g sodium in 1 L.water,; pH 7.2 adjusted with 1 M NaOH) in the presence of 1 mL 100 
mg/mL carbenicillin, 25 mb,8% lactose, 10 mL 60% glycerol, and 5 mL of 10% glucose per 1 L of 
expression culture for 24 hours at 25 °C. The cells were harvested by centrifugation (8,400 g for 20 
min) to yield 15.5 g/L of cells. The cell pellet was then re-suspended in 5 mL of Buffer A/1 g cell 
pellet (50 mM Tris pH 7.5, 0.2 M ammonium sulfate, 0.05 mM EDTA, 5 mM BME) containing 1 mg/mL 
lysozyme. After the cells were homogenized, they were lysed via sonication for 10 minutes with 10 s 
pulses at 50% amplitude using a Branson Digital Sonifier. The cell lysate was clarified by pelleting 
the cell debris via centrifugation (28,960 9g, 4 °C, 20 minutes) and loaded onto a 30 mL Phenyl 
Sepharose 6 Fast Flow HiSub column equilibrated with Buffer A. Protein was eluted with a gradient 
to 100% Buffer B (20 mM Tris pH 7.5, 0.05 mM EDTA, 5 mM BME) over five column volumes (150 


pro 


mL) collecting 5 mL fractions. Fractions containing HKU4 3CL"° were pooled and loaded onto a 60 
mL DEAE Sepharose Fast Flow column equilibrated with Buffer B. Protein was eluted with a gradient 
to 50% Buffer C (50 mM Tris pH 7.5, 1 M sodium chloride, 0.05 mM EDTA, 5 mM BME, 10% glycerol) 


over five column volumes (300 mL) collecting 5 mL fractions. Fractions containing pure HKU4-CoV 
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3CL” were pooled, dialized into storage buffer (25 mM HEPES, pH 7.5, 2.5 mM DTT, 10% 


glycerol), and concentrated. The protein was then aliquoted into 1.5 mL push-cap Eppendorf tubes, 
flash frozen in liquid nitrogen, and stored at -80 °C in a freezer until further use. 


4.2. IC,, Determination of inhibitors against HKU4-CoV 3CL™ at 25 °C and 37 °C: 

Each of the 230 dipeptide-like inhibitors were first screened for inhibition of HKU4 3CL”° at a 
concentration of 100 uM in duplicate assays containing the following assay buffer (50 mM HEPES, 
0.1 mg/mL BSA, 0.01% TritonX-100, 1 mM DTT). The assays were carried out in Costar 3694 
EIA/RIA 96-Well Half Area, Flat Bottom, Black Polystyrene plates from Corning Incorporated. 1 uL of 
100X inhibitor stock in DMSO was added to 79 uL of enzyme in assay buffer and the enzyme-inhibitor 
mixture was incubated for 10 minutes. The reaction was initiated by the addition of 20 uL of 10 uM 
UIVT3 substrate, a custom synthesized Forster resonance energy transfer substrate peptide with the 
following sequence: _ HilyteFluor™488-ESARLQSGLRKAK-QXL520™-NH,, producing _ final 
concentrations of 100 nM and 100 uM for the 3CL”° enzyme and UIVTS3 substrate, respectively. The 
fluorescence intensity of the reaction was then measured over time as relative fluorescence units 
(RFU,) for a period of 10 minutes, using an excitation wavelength of 485 and bandwidth of 20 nm and 
monitoring emission at 528 and bandwidth of. 20 nm using a BioTek Synergy H1 multimode 
microplate reader. 

The inhibition of HKU4-CoV 3CL™ by inhibitor compounds was monitored by following the 
change in RFUs over time, using the initial slope of the progress curve to determine the initial rate 


(V). The percent inhibition of the 3CL°° enzymes was determined using the following equation: 


Inhibited 3CLpro RFU/s — Background RFU/s 


ee 0 
Uninhibited 3CLpro RFU/s — Background RFU/s 


% Inhibition = E 


Full IC,,data were acquired for the compounds that showed greater than 50% inhibition of 
HKU4-CoV,.3CL”” at 100 uM of inhibitor compound. The IC,, values were determined at ambient 
temperature from 100 uL assays performed in triplicate in the following buffer: 50 mM HEPES, 0.1 
mg/mL BSA, 0.01% TritonX-100, 1 mM DTT. Kinetic assays were carried out in Costar 3694 EIA/RIA 
96-Well Half Area, Flat Bottom, Black Polystyrene plates from Corning Incorporated. Each inhibitor 
was tested at concentrations of 0.313, 0.652, 1.25, 2.5, 5.0, 10.0, 20.0, 40.0, 60.0, 80.0, 100.0, and 
120.0 uM; 1 uL of 100X inhibitor stock in DMSO was added to 79 uL of enzyme in assay buffer and 
the enzyme-inhibitor mixture was incubated for 10 minutes. The reaction was initiated by the addition 
of 20 uL of 10 uM UIVTS substrate, producing final concentrations of 100 nM and 2 uM for the 3CL”° 
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enzyme and UIVTS substrate, respectively. The fluorescence intensity of the reaction was then 


measured over time as RFU, for a period of 20 minutes, using an excitation wavelength of 485 and 
bandwidth of 20 nm and monitoring emission at 528 and bandwidth of 20 nm using a BioTek Synergy 
H1 multimode microplate reader. 

The percent inhibition of the 3CL"’ enzymes was then plotted as a function of inhibitor 
concentration. The SigmaPlot Enzyme Kinetics Wizard was used to fit the triplicate percent inhibition 
data and associated standard error to a non-linear Michaelis-Menten type regression model and 


determine the IC,, for each enzyme using the following equation: 


%Imax * [Inhibitor] 
% Inhibiti = 
SEU =e Tibor 
where %l,,.,is the percent maximum inhibition of 3CL™ and the error in IC,, values was determined as 


the error in the fitted parameter. 


4.3. Crystallization and X-ray structure determination of HKU4 3C-like protease in complex 
with inhibitors: 

The HKU4 3CL”’-inhibitor complexes. were co-crystallized from four different crystallization 
solutions. Briefly, the hanging-drop, vapor-diffusion method was used for crystallization by setting up 
drops and adding 1 uL of purified, HKU4-CoV 3CL™ (2.5 mg/mL, 75 uM), that had been incubated for 
three hours with a 3 molar excess of the appropriate inhibitor, and 1 uL of reservoir solution. For 1A: 
15% PEG-3350, 2% tacsimate, 5% isopropanol, 0.1 M imidazole pH 6; for 3B: 20% PEG-3350, 0.2 M 
ammonium acetate, 0.1,.M Bis-Tris pH 5.5; for 2A: 18% PEG-3350, 5% tacsimate, 0.2 M ammonium 
sulfate, 0.1 M Bis-Tris pH5.5; for 22A: 10% PEG-3350, 2% tacsimate, 5% isopropanol, 0.1 M 
imidazole pH»6.5» Protein crystals appeared between 14 hours and 7 days after setting up 
crystallization plates. Crystals were harvested with a nylon loop, which was then swiped through the 
same mother-liquor solution supplemented with 15% MPD. The crystals were cooled by plunging into 
liquid nitrogen and stored in shipping dewars containing liquid nitrogen until X-ray data collection at 
an available synchrotron could be performed. 

All diffraction data were collected at 100 K at the Life Sciences Collaborative Access Team 
(LS-CAT) at the Advanced Photon Source (APS) at Argonne National Laboratories. Crystals were 
transferred from shipping dewars into automated dewars and then and mounted robotically on a 
goniostat while under a stream of N,. X-ray data sets of HKU4 3CL”-inhibitor complexes were 


collected on a Rayonix 225 HE detector at a wavelength of 0.98 A. X-ray data were processed and 
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scaled using the program HLK2000, crystals of HKU4-CoV 3CL°” complexes with the inhibitors ‘a 
2A, and 3B belonged to the space group P2,2,2, and HKU4-CoV 3CL™ crystallized as a dimer in the 
asymmetric unit. Co-crystallization of HKU4-CoV 3CL™ with 22A did not result in an HKU4-CoV 
3CL””:inhibitor co-complex, but instead yielded the apo form of HKU4-CoV 3CL”", which crystallized 
in a different space group, P3,21 with one dimer per asymmetric unit. 

The initial phases for HKU4-CoV 3CL°° in complex with 1A, 2A, and 3B were determined by 
molecular replacement using Phaser-MR (simple interface) in the Phenix suite and the protein 
database PDB file 2YNA.“ The inhibitors were built and optimized from their SMILES strings using 
eLBOW and fit into the density using LigandFit in the Phenix suite and the program COOT. 
Structures were refined using Phenix. Water molecules were added manually to the 2F-F density 
peaks that were greater than 3.0 and iterative rounds of refinement/were carried out until R_.,,and 
gee 


using Phenix and COOT and the figures generated using the program PyMol. 


work 


reached their lowest values. The electron density maps presented in the figures were calculated 


4.4. General chemistry methods 

All reagents and chemicals were purchased from SigmaAldrich or Acros Organics and used 
without further purification. Anhydrous solvents were purchased or purified by passage through a 
solvent column composed of activated alumina and a supported copper redox catalyst. Analytical 
thin-layer chromatography was performed using Sorbent Technologies Glass-Backed Silica Gel HL 
TLC plates w/UV254. Flash chromatography was performed with Sorbent Technologies 200-400 
mesh silica gel. Nuclear magnetic resonance (NMR) data was acquired using either a Varian 
Inova300, Bruker ARX400, or Bruker AV-III-800 spectrometers at 300, 400, or 800 MHz for 'H or 75, 
100, or 200 MHz for*G. All compounds were purified by flash chromatography to >90% purity by 'H 
NMR. 


2-(1H-benzofaj[1 ,2,3]triazol-1-yl)-N-(thiophen-3-ylmethyl)acetamide (22A). 

To 2-(1H-benzo[a][1,2,3]triazol-1-yl)acetic acid (1.5 eq.) in anhydrous dichloromethane (0.2 M) 
at 0 °C under nitrogen atmosphere was added EDC (3.0 eq.) and HOBT (3.0 eq.). The mixture was 
allowed to stir for five minutes before the addition of DIEA (13.0 eq.). The mixture was allowed to stir 
for an additional 20 minutes before the addition of thiophen-3-ylmethanamine (1.0 eq.). The reaction 
mixture was allowed to warm to ambient temperature and stirred under nitrogen atmosphere for 16 
hours. The reaction was quenched by addition of saturated potassium bicarbonate and the aqueous 
layer was extracted with ethyl acetate (3X). The combined organic layers were washed with water 


(1X) and brine (1X), dried over sodium sulfate, and concentrated. The crude residue was 


16 
recrystallized from hot ethanol to provide the product as a beautiful white, crystalline solid in 30% 


yield. ‘'H NMR (800 MHz, CD,OD) (ppm): 8.03 (1H, d, J = 8.37), 7.72 (1H, d, J = 8.37), 7.59 (1H, dd, 
J = 8.37, 0.96), 7.48 (1H, dd, J = 8.37, 0.96), 7.40 (1H, dd, J = 4.94, 2.97), 7.29 (1H, m), 7.08 (1H, dd, 
J = 4.93, 1.30), 5.51 (2H, s), 4.46 (2H, s). “C NMR (200 MHz, CD,OD) (ppm): 166.47, 145.30, 
138.67, 133.83, 127.63, 126.92, 125.81, 124.28, 121.90, 118.60, 110.11, 49.87, 38.19. ESI-MS(-+): 
273 [M + 1], 295 [M + 23]. 


N-(4-aminophenyl)thiophene-2-carboxamide (23A). 

Thiophene-2-carboxylic acid (1.0 eq.) and HATU (1.4 eq.) were added to anhydrous 
dichloromethane (0.2 M) at room temperature followed by drop-wise addition of DIEA (3.0 eq.) under 
nitrogen atmosphere. The mixture was allowed to stir at ambient temperature for 5 minutes before 
benzene-1,4-diamine was added in one portion. The reaction mixture was allowed to stir for 14 hours 
before it was quenched with saturated potassium bicarbonate: The aqueous layer was extracted with 
ethyl acetate (8X) and the combined organic extracts were washed with water (1X) and brine (1X), 
dried over sodium sulfate, and concentrated. The crude yellow residue was purified by flash 
chromatography using a gradient of 20% to 70% ethyl acetate in hexanes to give the final product in 
50% yield. This reaction was performed from mg to g scale with no detrimental effect on yield. 'H 
NMR (800 MHz, CD,OD) (ppm): 7.84 (1H, d, J = 3.74), 7.67 (1H, dd, J = 4.97, 1.08), 7.37 (2H, d, J = 
8.56), 7.15 (1H, dd, J = 4.97, 3.74).“ °C NMR (200 MHz, CD,OD) (ppm): 161.26, 144.66, 139.63, 
130.56, 128.71, 128.31, 127.49, 122.85. ESI-MS(+): 219 [M + 1], 241 [M + 23]. 


N-(4-((thiophen-3-ylmethyl)amino)phenyl)thiophene-2-carboxamide (24A). 

23A (1.0 eq.) and thiophene-3-carbaldehyde (1.0 eq.) were dissolved in anhydrous 1,2-DCE 
(0.2 M) at ambient temperature under nitrogen atmosphere. Sodium triacetoxyborohydride (1.6 eq.) 
and acetic acid (1.0°eq.) were then added in one portion. The reaction mixture was allowed to stir for 
30 minutes before the reaction was quenched with 1 N sodium hydroxide. The aqueous layer was 
extracted with ethyl acetate (3X) and the combined organic extracts were washed with water (1X) and 
brine (1X), dried over sodium sulfate, and concentrated. The crude yellow residue was purified by 
flash chromatography using a gradient of 20% to 70% ethyl acetate in hexanes to give the final 
product in 50-80% yield. 'H NMR (800 MHz, CD,OD) (ppm): 7.85 (1H, m), 7.69 (1H, m), 7.36 (3H, 
m), 7.26 (1H, m), 7.17 (1H, d, J = 5.15), 7.12 (1H, d, J = 5.15), 6.70 (2H, m), 4.34-4.35 (2H, rotomeric 
peaks). “C NMR (200 MHz, CD,OD) (ppm): 172.76, 146.24, 141.08, 130.47, 128.20, 127.65, 
127.43, 126.82, 125.27, 122.87, 120.85, 112.81, 43.08). ESI-MS(+): 315 [M + 1], 337 [M + 23]. 
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7. Figure Legends 


Figure 1. Genome and proteome organization of HKU4-,-HKU5-, and MERS-CoV non- 
structural proteins highlighting the PLpro and 3CL-pro cleavage sites. 


Figure 2. X-ray crystal structures of 1A, 2A and 3B in complex with HKU4-CoV 3CL”°. 
Inhibitors are colored according to atom type and shown in ball and stick representation. Wall-eye 
stereo-view is shown in panels A, C, and E. Water molecules are shown as red spheres. Hydrogen 
bonds are shown as gray dashes with distances between heteroatoms labeled (A). Interatomic 
distances (A) are displayed using a gray arrow.»Electron density omit maps (F,— F.) are shown in 
grey mesh and are contoured to +3.0 around the inhibitor only. The binding orientation of each 
inhibitor is the same in the active site of each monomer of the dimer in the asymmetric unit of HKU4- 
CoV 3CL*”, therefore only one active site is shown for clarity. (A) HKU4-CoV 3CL™ (pale yellow) in 
complex with 1A (sky blue), PDB 4YOI, (B) 1A electron density omit map, (C) HKU4-CoV 3CL”° 
(wheat) in complex with 2A (pale,cyan); PDB 4YOu, (D) 2A electron density omit map, (E) HKU4-CoV 
3CL” (light orange) in complex.with 3B (slate), PDB 4YOG, (F) 3B electron density omit map 
contoured at +3.0 (blue) and+2.0 (grey). 


Figure 3. (A) Superposition of unbound HKU4-CoV 3CL”” (gray, PDB 4YO9) and HKU4-CoV 
3CL”” co-crystallized with inhibitor 1A (where HKU4-CoV 3CL”° is shown in ribbon representation in 
light yellow and 1A is shown in sky blue and colored by element, PDB 4YOl). The Zn atom in the 
unbound HKU4-CoV 3CL”” is represented as a gray sphere. (B) Superposition of HKU4-CoV 3CL”” 
co-crystallized with 1A (where HKU4-CoV 3CL”” is shown in ribbon representation in light yellow and 
1A is shown,in sky blue and colored by element, PDB 4YOI) and HKU4-CoV 3CL”° co-crystallized 
with 2A (where HKU4-CoV 3CL”” is shown as cartoon in wheat and 2A is shown in pale cyan and 
colored by element, PDB 4YOJ). (C) Superposition of HKU4-CoV 3CL”° co-crystallized with 1A 
(where HKU4-CoV 3CL” is shown in ribbon representation in light yellow and 1A is shown in sky blue 
and colored by element, PDB 4YOI) and HKU4-CoV 3CL”” co-crystallized with 3B (where HKU4-CoV 
3CL™ is shown as cartoon in light orange and 3B is shown in slate and colored by element, PDB 
4YOG). Wall-eye stereo-view is shown in each panel. 


Figure 4. Effect of 13B stereocenter configuration on HKU4-CoV 3CL”” inhibition. Where % 
inhibition is determined at 100 uM inhibition concentration, n/t = not tested. 


Figure 5. Sequence alignment of HKU4-CoV, HKU5-CoV, and MERS-CoV 3CL”° where the 
top row indicates the secondary structure of HKU4-3CLpro at a particular residue position, conserved 
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577 residues among all three 3CL’’s are shaded in red, and residues that have similar steric or 
578 electronic properties are shown in red text with no shading. Non-conserved residues are displayed in 
579 black text. The pertinent active site residues that border and interact with the peptidomimetic 
580 inhibitors (Met25, His41, Cys44, Ala46, Tyr54, Cys148, His166, Glu169, and Gln192) are indicated by 
581 blue arrows. 
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590 Table 1a. Class A peptidomimetic inhibitors. 
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